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,— 1 ' Abstract 

t-h ; 

\ Winfree's abstract Tile Assembly Model (aTAM) is a model of molecular self-assembly of 

■ DNA complexes known as tiles, which float freely in solution and attach one at a time to a 
?H \ growing "seed" assembly based on specific binding sites on their four sides. We show that 

■ there is a polynomial-time algorithm that, given an n x n square, finds the minimal tile system 
\ (i-G-j the system with the smallest number of distinct tile types) that uniquely self-assembles 

■ the square, answering an open question of Adlcman, Cheng, Goel, Huang, Kempe, Moisset 
de Espancs, and Rothcmund (Combinatorial Optimization Problems in Self-Assembly, STOC 
2002). Our investigation leading to this algorithm reveals other positive and negative results 

>-/ ' about the relationship between the size of a tile system and its "temperature" (the binding 

strength threshold required for a tile to attach). 

O . 

1 Introduction 

(N ■ 

' Tile self-assembly is an algorithmically rich model of "programmable crystal growth" . It is possible 

^ ■ to design monomers (square- like "tiles") with specific binding sites so that, even subject to the 

t^J- . chaotic nature of molecules floating randomly in a well-mixed chemical soup, they are guaranteed 

to bind so as to deterministically form a single target shape. This is despite the number of different 
7— I ■ types of tiles possibly being much smaller than the size of the shape and therefore having only 

"local information" to guide their attachment. The ability to control nanoscale structures and 
| machines to atomic-level precision will rely crucially on sophisticated self-assembling systems that 

automatically control their own behavior where no top-down externally controlled device could fit. 

■ A practical implementation of self-assembling molecular tiles was proved experimentally fea- 
sible in 1982 by Seeman [34] using DNA complexes formed from artificially synthesized strands. 
Experimental advances have delivered increasingly reliable assembly of algorithmic DNA tiles with 
error rates of 10% per tile in 2004 [32], 1.4% in 2007 [17], 0.13% in 2009 [7], and 0.05% in 2010 [16]. 
Erik Winfree [41] introduced the abstract Tile Assembly Model (aTAM) - based on a constructive 
version of Wang tiling [39,40] - as a simplified mathematical model of self-assembling DNA tiles. 
Winfree demonstrated the computational universality of the aTAM by showing how to simulate 
an arbitrary cellular automaton with a tile assembly system. Building on these connections to 
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computability, Rothemund and Winfree [31] investigated a self-assembly resource bound known 
as tile complexity, the minimum number of tile types needed to assemble a shape. They showed 
that for most n, the problem of assembling an n x n square has tile complexity r2( lo 1 °^ n ), and 
Adleman, Cheng, Goel, and Huang [3] exhibited a construction showing that this lower bound is 
asymptotically tight. Under natural generalizations of the model [1,6,8,10-13,25,26,29,36,37], tile 
complexity can be reduced for the assembly of squares and more general shapes. 

The results of this paper are motivated by the following problem posed in 2002. Adleman, 
Cheng, Goel, Huang, Kempe, Moisset de Espanes, and Rothemund [5] showed that there is a 
polynomial time algorithm for finding a minimum size tile system (i.e., system with the smallest 
number of distinct tile types) to uniquely self-assemble a given n x n square, 1 subject to the 
constraint that the tile system's "temperature" (binding strength threshold required for a tile to 
attach) is 2. They asked whether their algorithm could be modified to remove the temperature 
2 constraint. Our main theorem answers this question affirmatively. Their algorithm works by 
brute- force search over the set of all temperature-2 tile systems with at most 0( ^°f^ n ) tile types, 
using the fact proven by Adleman, Cheng, Goel, and Huang [3] that such an upper bound on tile 
complexity suffices to assemble any n x n square. A simple counting argument shows that for 
any constant r, the number of tile systems with glue strengths and temperature at most r and 

log Log n ) types is bounded by a polynomial in n. One conceivable approach to extending 
the algorithm to arbitrary temperature is to prove that for any tile system with K tile types, the 
strengths and temperature can be re-assigned so that they are upper-bounded by a constant or 
slow-growing function of K, without affecting the behavior of the tile system. 2 However, we show 
that this approach cannot work, by demonstrating that for each K, there is a tile system with K 
tile types whose behavior cannot be preserved using any temperature less than 2 K f A . The proof 
crucially uses 3- cooperative binding, meaning attachment events that require three different glues 
of a tile to match the assembly. On the other hand, we show that any 2-cooperative tile system 
(which captures a wide and useful class of systems) with K tile types is behaviorally equivalent to 
a system with temperature at most 2K + 2. 

Of course, the choice of integer glue strengths is an artifact of the model. Nonetheless, our 
investigation does reflect fundamental questions about how finely divided molecular binding energies 
must be in a real molecular self-assembly system. The requirement of integer strengths is simply 
one way of "quantizing" the minimum distinction we are willing to make between energies and then 
re-scaling so that this quantity is normalized to l. 3 Our 3-cooperative lower bound therefore shows 
that in general, certain self- assembling systems that have very large gaps between some of their 
binding energies nonetheless require other binding energies to be extremely close (exponentially 
small in terms of the larger gaps) and yet still unequal. This can be interpreted as an infeasibility 
result if one defines "exponentially fine control" of binding energies as "infeasible" to execute in 
any real laboratory, since no implementation of the specified tile behavior can use courser energies. 

As a converse to the exponential temperature lower bound stated above, we show that there 
is a polynomial-time algorithm that, given any tile system T with K tile types specified by its 
desired binding behavior, finds a temperature and glue strengths at most 2°^ that implement 

The square is encoded by a list of its points, so the algorithm's running time is polynomial in n. 

2 We define "behavior" more formally in Section 3. Briefly, we consider a tile system's behavior unaltered by a 
reassignment of strengths and temperature if, for each tile type t, the reassignment has not altered the collection of 
subsets of sides of t that have sufficient strength to bind. 

3 Indeed, our proof does not require that strengths be integer, merely that the distance between the smallest energy 
strong enough to bind and the largest energy too weak to bind be at least 1. 
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this behavior or reports that no such strengths exist. This algorithm is used to show our main result, 
that there is a polynomial-time algorithm that, given an n x n square, determines the smallest tile 
assembly system (at any temperature) that uniquely self-assembles the square, answering the open 
question of [5]. 

2 Abstract Tile Assembly Model 

This section gives a brief informal sketch of the abstract Tile Assembly Model (aTAM). See Section 
A for a formal definition of the aTAM. 

A tile type is a unit square with four sides, each having a glue label (often represented as a 
finite string). We assume a finite set T of tile types, but an infinite number of copies of each tile 
type, each copy referred to as a tile. An assembly (a.k.a., supertile) is a positioning of tiles on 
(part of) the integer lattice Z 2 ; i.e., a partial function I? — » T. For a set of tile types T, let 
A(T) denote the set of all glue labels of tile types in T. We may think of a tile type as a function 
t : {N,S, E, W} — > A(T) indicating, for each direction d G {N,S, E,W} ("north, south, east, west"), 
the glue label t(d) appearing on side d. A strength function is a function g : A(T) — > N indicating, 
for each glue label i, the strength g(£) with which it binds. Two adjacent tiles in an assembly 
interact if the glue labels on their abutting sides are equal and have positive strength according 
to g. Each assembly induces a binding graph, a grid graph whose vertices are tiles, with an edge 
between two tiles if they interact. The assembly is r-stable if every cut of its binding graph has 
strength at least r, where the weight of an edge is the strength of the glue it represents. That is, 
the assembly is stable if at least energy r is required to separate the assembly into two parts. 

A tile assembly system (TAS) is a quadruple T = (T, a, g, r), where T is a finite set of tile types, 
a : 1? —■* T is a finite, r-stable seed assembly, g : A(T) — > N is a strength function, and r is a 
temperature. In this paper, we assume that all seed assemblies a consist of a single tile type (i.e., 
|dom o~\ = 1). Given a TAS T = (T, a, g, r), an assembly a is producible if either a = a or if ft is a 
producible assembly and a can be obtained from ft by placing a single tile type t on empty space (a 
position p£Z 2 such that ft(p) is undefined), such that the resulting assembly a is r-stable. In this 
case write ft — >\ a (a is producible from ft by the stable attachment of one tile), and write ft — > a 
if ft — >* a (a is producible from ft by the stable attachment of zero or more tiles). An assembly is 
terminal if no tile can be stably attached to it. Let A[T] be the set of producible assemblies of T, 
and let ^4n[T] Q -A[T] be the set of producible, terminal assemblies of T ■ Given a connected shape 
S C I? , a TAS T uniquely self- assembles S if .An[T] = {3} and dom a = S. 

3 Finding Strengths to Implement a Tile System 

In this section we show that there is a polynomial-time algorithm that, given a desired behavior of 
a tile system, can find strengths to implement that behavior that are at most exponential in the 
number of tile types, or report that no such strengths exist. This algorithm is the main technical 
tool used in the proof of our main result, Theorem 4.2. 

First, we formalize what we mean by the "behavior" of a tile system. Let T be a set of tile types, 
and let t G T. Given a strength function g : A(T) — > N and a temperature r G Z + , define the cooper- 
ation set oft with respect to g and r to be the collection V g%T {t) = { D C {N, S, E, W} | J2 deD g(t(d)) > r }, 
i.e., the collection of subsets of sides of t whose glues have sufficient strength to bind cooperatively. 
Let a : 1? — -> T be a seed assembly, let t\,ti G Z + be temperatures, and let gi,g2 '■ A(T) — > N 
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be strength functions. We say that the TAS's T\ = (T,a,g\,Ti) and T 2 = (T,a,g2,T2), differing 
only on their strength function and temperature, are locally equivalent if, for each tile type t £ T, 
~Dgi,Ti(t) = ^g2,T2 (*)- 4 The behavior of an individual tile type during assembly is completely deter- 
mined by its cooperation set, in the sense that if 71 and T 2 are locally equivalent, then A[T\] = A[T 2 ] 
and Aa[Ti] = A D [T 2 }. 5 

Even without any strength function or temperature, by specifying a cooperation set for each 
tile type, one can describe a "behavior" of a TAS in the sense that its dynamic evolution can be 
simulated knowing only the cooperation set of each tile type. We call a system thus specified a 
strength-free TAS. More formally, a strength-free TAS is a triple (T,a,T>), where T is a finite set 
of tile types, a : I? — - > T is the size-1 seed assembly, and P:T4 V(V({N, S, E, W})) is a function 
from a tile type t G T to a cooperation set V(t). For a standard TAS T = (T,a,g,r) and a 
strength-free TAS 7sf = (T, a, T>) sharing the same tile set and seed assembly, we say that T and 
%f are locally equivalent if T> g ^ T (t) = T>(t) for each tile type t € T. 

Note that every TAS has a unique cooperation set for each tile type, and hence, has a locally 
equivalent strength-free TAS. Say that a strength-free TAS is implementable if there exists a TAS 
locally equivalent to it. Not every strength-free TAS is implementable. This is because cooperation 
sets could be contradictory; for instance, two tile types t\ and t 2 could satisfy ii (N) = t 2 {Wj and 
{N} € V(t x ) but {N} &V(t 2 ). 

Theorem 3.1. There is a polynomial time algorithm that, given a strength-free TAS%f = (T,o~,T>), 
determines whether there exists a locally equivalent TAS T = (T, a, g, r) and outputs such a T if it 
exists. Furthermore, it is guaranteed that r < 2°^ T ^ . 

Proof. Intuitively, we count the number of different total behaviors a tile can have, based on 
equating its behavior with its cooperation set. We then cast the problem of finding strengths 
to implement a given strength-free tile system as finding solutions to a certain system of linear 
inequalities, which is solved by Gaussian elimination. We then argue that the vertices of the 
polytope defined by this system have rational numbers with numerator and denominator that 
are at most exponential in the number of inequalities, which is itself linear in the number of tile 
types. This implies that by multiplying the rational numbers at one of these vertices by their least 
common multiple, which preserves the inequalities, we obtain an integer solution with values at 
most exponential in the number of tile types. 

Formally, let %f be a strength-free TAS with a tile set T of k tile types with u < k + 1 different 
glues. 6 We would like to decide whether 7^ is in fact realizable by a TAS. To have the tightest 
upper bound on the temperature, ideally we could to solve the problem of finding the minimum 
temperature TAS that is locally equivalent to %{. This optimization problem can be cast as an 
integer linear program on a temperature variable r and a set of glue-strength variables s\, s 2 , . . . , s u 
as in the following example: 

4 Note that the definition of equivalence is independent of the seed assembly; we include it only to be able to talk 
about the equivalence of TAS's rather than the more cumbersome "equivalence of triples of the form (T, g, r)." 

5 The converse does not hold, however. For instance, some tile types may have a subset of sides whose glues never 
appear together at a binding site during assembly, so it would be irrelevant to the definition of A[T] and .Arj[T] 
whether or not that combination of glues have enough strength to bind. 

6 Each tile type has 4 sides so it might seem that there could be 4fc total glues if there are k tile types. However, 
in a nontrivial system (one that has no "effectively null" glues that appear on only one side of any tile type), for each 
side of a tile type, the choice of glue for that side is limited to those glues on the opposite side of the k tile types, or 
alternately we could choose the null strength-0 glue. 
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Minimize r 

subject to t, s±, s 2 , • • • , s u £ N 

H + S3 - t > 

si + s 4 + s 6 - r > 

si + s 2 + s 4 - r < -1 
s 2 + 2s 3 -r < -1 

The "> 0" inequalities correspond to the union of all cooperation sets of all tile types, and the 
"< —1" inequalities correspond to the union of all complements of the cooperation sets; i.e., each 
set "P({N, S, E, W}) \ T>(t), where T>(t) is the cooperation set of t. Since we require each strength to 
be an integer, "< —1" is equivalent to "< 0". Since each tile type t has |2?(t)| "> 0" inequalities 
(one for each subset of sides in its cooperation set) and 16 — "< —1" inequalities, there are 

16k inequalities in the integer linear program. 

However, solving this optimization problem is not necessary to prove the theorem. Our goal will 
not be to find the smallest temperature TAS that satisfies the constraints above (which remains 
an open problem to do in polynomial time), but simply to find any feasible integer solution with 
temperature and strengths at most 2°( k \ Call the above system of constraints (including the in- 
teger constraint) S\. Consider the real- valued system of linear inequalities S2 defined as the above 
inequalities with the integer constraint r, s±, s 2 , • • • , s u £ N relaxed to simply r, si, S2, ■ ■ ■ , s u > 0. 
Then we have the implication "Si has a solution" ==>• "S2 has a solution". Conversely, any 
rational-valued solution to S2 can be converted to an integer-valued solution to Si by multiplying 
each value by the least common multiple L of the denominators of the rational numbers. 7 Further- 
more, since the input coefficients are integers, if the feasible polytope of S2 is non-empty, then all of 
its vertices are rational. Therefore S2 has a solution if and only if it has an integer solution. Since 
any integer solution to S2 is a solution to Si, we have the full bidirectional implication "Si has a 
solution" <4=> "S2 has an integer solution" . We can pick any n linearly independent inequalities 
of S2, interpret them as equalities, and use Gaussian elimination (with exact rational arithmetic) 
to obtain some vertex of the feasible polytope described by the inequalities, and convert these to 
integer solutions to Si through multiplication as described above. If we cannot find n linearly 
independent inequalities (testable by computing the rank of the matrix defining the inequalities) 
then there is no TAS implementing the behavior of %f. It remains to show that in case there is a 
solution, the integers we obtain by this method obey the stated upper bound 2°( fc ). 

Each coefficient has absolute value at most 2 (since we may assume N/S glues are disjoint from 
E/W glues), and each equation has at most 5 nonzero left side terms since each tile type has only 
4 sides (together with the —1 coefficient for r). Applying Lemma 3.2 (stated and proven after the 
current proof) with n = u + 1, c\ = 2, and C2 = 5, we obtain that each vertex is a rational vector 
x = (£i,...,gi) such that, for each 1<»<« + 1, \pi\,\qi\ < 2 u+l 6^ u+l ^ 2 = 2«+ 1 +((«+ 1 )/ 2 )- lo s 6 < 

2 3u+3 . Since we enforce nonnegativity, pi = \pi\ and qi = \qi\. We multiply the rational vector x 
by L = LCM(gi, . . . , q u +i) to obtain the integer vector x' . By Lemma 3.2, L < 2 3u+3 . Then each 
integer solution value x\ obeys x • = L ■ x { < (2 3 "+ 3 ) 2 < 2 2 ( 3 ( fc+1 )+ 3 ) = 2 oi - k \ □ 



7 This actually enforces the stronger condition that each "< —1" inequality is actually "< — L" . This is possible 
because we have no upper bound on the variables, which would prevent multiplication from necessarily preserving 
the inequalities. 
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Lemma 3.2. Let c±, C2 £ Z + be constants, and let b be annxl integer column vector and A = (a^) 
be a nonsingular n x n integer matrix such that for each \aij\ < c\ and \bj\ < c\, and each row of 
A contains at most 02 nonzero entries. Then the solution to the linear system Ax = b is a rational 
vector x £ Q n such that, if each component X{ = — is written in lowest terms with Pi,qi £ Z, then 

\p-i\ < c™(c2 + l) n / 2 and \qi\ < c^c^ 2 . Furthermore, the least common multiple of all the 's is at 
most CjC2 . 

Proof Recall Hadamard's inequality | det j4| < fJiLi ll^ilb, where V{ is the i th row of A. 8 Since 
Vi has at most C2 nonzero entries that are each at most absolute value c±, Hadamard's inequality 
tells us that |detA| < YYi=i V c z ' c i = c i c 2^ 2 - Similarly, letting Ai be A with column vector b 
replacing A's 2 th column, Ai has at most C2 + 1 nonzero entries per row, so a similar argument 
gives | detail < c™(c2 + l) n / 2 . The i th solution is Xi = by Cramer's rule. Since A and A4 are 

integer- valued, so are det A and det A+, whence the upper bounds on | det A\ and | det AA also apply 
to \qi\ and \pi\, respectively, since they are Xj's lowest terms representation. Since the lowest terms 
representation of each q^ necessarily divides det A, this implies that their least common multiple is 
also at most c^c^ 2 . □ 

4 Finding the Minimum Tile Assembly System Assembling a Square 
at any Temperature 

This section proves the main result of this paper. We show that there is a polynomial-time al- 
gorithm that, given an n x n square S n , computes the smallest TAS that uniquely self-assembles 
S n . Adleman, Cheng, Goel, Huang, Kempe, Moisset de Espanes, and Rothemund [5] showed that 
this problem is polynomial-time solvable when the temperature is restricted to be 2, and asked 
whether there is an algorithm that works when the temperature is unrestricted, which we answer 
affirmatively 

The next proposition is useful in the proof of our main theorem. 

Proposition 4.1. For each k G Z + , there are at most 168 fc /c 4fc+2 implementable strength-free TAS's 
with at most k tile types. 

Proof. If T = (T, a, T>) is an implementable strength-free TAS, then for all t £ T, the cooperation 
set T>{t) of t is a collection of subsets of {N,S, E, W} that is closed under the superset operation, 
i.e., if D C D' C {N,S, E, W} and D £ T>(t), then D' £ V(t). This closure property is due to the 
fact that having strictly more sides available to bind cannot inhibit binding that would otherwise 
occur, as long as strengths are assumed to be nonnegative. Each cooperation set T> is defined by a 
unique antichain, which is a subcollection V = {D[, . . . ,D' m } C T> such that, for all 1 < i,j < m, 
D[ % Dj, whose closure under the superset operation is equal to T>. The antichain consists of 
the minimal elements of T> under the partial order C. The number of antichains of subsets of 
{N,S,E,W} is given by the fourth Dedekind number M(4) = 168 [35]. Thus, a tile type has at 
most 168 different possible cooperation sets. 

For each side of a tile type, there are at most k glue labels to choose, so there are at most k 4 ways 
to assign these labels to each side. Therefore, encoding each tile type as a list of 4 glue labels and 

8 Hadamard's inequality is typically stated for Vi a column of A, but the determinant of a matrix and its transpose 
are equal so the bound holds when taking the product over rows as well. 
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cooperation set, and encoding a TAS as a list of tile types, there are at most (168fc 4 ) fc = 168 k k 4k 
different strength-free tile sets with k tile types. Since there are k choices for the seed tile, there 
are at most 168 fc fe 4fc+1 different strength-free TAS's with k tile types. Thus there are at most 
£*Li 168H U+1 < 168 fc /c 4fc+2 such TAS's with at most k tile types. □ 

The following theorem is the main result of this paper. 

Theorem 4.2. There is a polynomial-time algorithm that, given an n x n square S n , outputs a 
minimal TAS T that uniquely self- assembles S n . 

Proof. In [5], the authors study the variant of the minimum tile set problem restricted to squares 
where the temperature is fixed at r = 2. They use the following argument to show the problem is 
solvable in polynomial time. For all n € N, let S n = {0, 1, . . . ,n — l} 2 denote the n x n square. 
Adleman, Cheng, Goel, and Huang [3] showed that for all n 6 N, there is a TAS with at most 

log log n ) types that uniquely self-assembles S n . The proof of [5] first shows by a simple 
counting argument that there are at most a polynomial in n number of temperature-2 TAS's with 
at most O ( fog fog n ) tile types, using the fact that all strengths may without loss of generality be 
assumed to be 0, 1, or 2. They then make use of a polynomial-time algorithm Unique-Shape 
devised in the same paper [5] that, given any shape S and any TAS T, determines whether T 
uniquely self-assembles S. Finding the minimum TAS for S n then amounts to iterating over every 
"small" (Q( 10 gfog n ) tile types) TAS T and using the algorithm Unique-Shape to check which of 
these systems assemble S n . The upper bound of [3] guarantees that Unique-Shape will report a 
positive answer for at least one of these systems. 

Let c 6 Z + be the constant, shown to exist in [3], such that, for all n £ N, some tile system 
with at most c log nj log log n tile types uniquely self-assembles S n . Let k = c log n/ log log n. 
Enumerate each strength-free TAS with at most k tile types and test whether the strength-free 
system uniquely self-assembles S n . The algorithm Unique-Shape of [5] can be executed just as 
easily with a strength- free TAS as with a standard TAS, so Unique- Shape may be used for this 
test. 

Proposition 4.1 tells us that the number of strength-free TAS's with at most k tile types is at 
most 168 k k 4k+2 . We have that 168 fc < i68 clo s n / lo s lo s n < 168 clogn < (2 lo g") cl °s 168 < n 8c , and 

k 4k+2 < fc 2 (clogn/loglogn) 4cl °s n / lo s lo s n 

^logCc log n l l°g log n ) ^4c log n/ log log n 

^2'°§ c +l°g log n ~ log log log fi\Ac log n/ log logn 

jl.2 log n(log c+log log n— log log log n) / log log n 
jJ2 ^log *^4c(log c+log log n— log log log n) / log log n 

^,2^4c log c/ log log n+4c— 4c log log log n/ log log n 
< n 4clogc+4c+2^ 

whence the number of strength-free TAS's to search is at most n 4c log c + 12c + 2 = poly(ra). Once we 
obtain a smallest such strength-free TAS %f, we run the algorithm described in Theorem 3.1 to 
obtain a minimal standard TAS T that uniquely self-assembles S n . □ 

We note that while we have stated the theorem for the family of square shapes, our method, 
as well as that of [5], works for any family of shapes Si, S2, ■ ■ ■ where \S n \ = poly(n) and the tile 
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complexity of S n is at most 0( lo °^ n ). This includes, for instance, the family {T\,T2, ■ ■ .}, where 
T n is a width-n right triangle, and for each q € Q + the family {Rq t \, Rq,2, ■ ■ where i? 9jn is the 
n x \qn\ rectangle. 

5 Bounds on Temperature Relative to Number of Tile Types 

This section shows two bounds relating the number of tile types in a TAS to its temperature. The 
first bound, Theorem 5.1, shows that there are TAS's that require temperature exponential in the 
number of tile types (in the sense of local equivalence as defined in Section 3), if any combination 
of sides may be used for binding. This result can be interpreted to mean that the algorithm of [3] 
to find the minimum temperature-2 TAS for assembling an n x n square, which searches over all 
possible assignments of strengths to the glues, cannot be extended in a straightforward manner 
to handle larger temperatures, which is why it is necessary for the algorithm of Theorem 4.2 
to "shortcut" through the behaviors of tile types rather than enumerating strengths. The second 
bound, Theorem 5.2, on the other hand, shows that if we restrict attention to those (quite prevalent) 
classes of tile systems that use only one or two sides of tiles to bind, then linear temperature always 
suffices. 

5.1 Tile Assembly Systems Requiring Temperature Exponential in Number of 
Tile Types 

In this section, we prove that a temperature that is exponential in the number of tile types given 
by Theorem 3.1 is optimal, although there is a gap between the exponents (2l T l/ 4 for Theorem 5.1 
below versus 0(2 6 I T I) for Theorem 3.1). 

Theorem 5.1. For every n 6 Z + , there is a TAS T = (T, a, g, r) such that \T\ = An and for every 
TAS T' = (T,cr, g' ,t') that is locally equivalent to T , r' > 2 n . 

Proof. The tile set T is shown in Figure l. 9 In each stage, each of the top two light tile types 
represents a triple (a pair in stage 1) of glues whose sum is at least r. Each of the bottom two dark 
tiles represents a triple of glues whose sum is less than r. For the dark tile types to be nontrivial, 
we could imagine that the (unlabeled) north glue is strong enough to cooperate with some of the 
other glues. The actual strengths of the glues are left as variables, but the caption of Figure 1 gives 
one example of strengths that would satisfy the inequalities that the tile types represent. 

We prove by induction on n that A" n > A n + 2 n . For the base case, in the first stage, the top 
light tile type and top dark tile type enforce that A' 1 + B[ > r > A\ + B[, so A± < A^. Similarly, the 
bottom light tile type and the bottom dark tile type enforce that A'-y < A'{. Therefore A'[ > A\ + 2. 

For the inductive case, assume that A" n _ x > A n _\ + 2™" 1 . The top dark tile type enforces that 
r > A^_ x + A n + B' n , and by the induction hypothesis, A' n _ x + A n + B' n > A n _i + 2 n ~ l + A n + B' n . 
The top light tile type enforces that A' n + A ra _i + B' n > r, which combined with the previous two 
inequalities shows that A' n > 2 n_1 + A n . A similar analysis with the bottom light tile type and 
bottom dark tile type shows that A n > 2 n ~ l + A' n , whence A' n > 2 n ~ x + 2™" 1 + A n , establishing 
the inductive case. 

9 We do not specify the seed assembly since we are concerned only with the local behavior of the tiles. To make 
the local equivalence nontrivial, we would need to add a small number of tile types to the TAS to ensure that each 
tile shown is actually attachable at some point during assembly. However, but this would not affect the asymptotic 
size of the tile set as n — > oo, so the exponential lower bound on the temperature would still hold. 
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Figure 1: A set of tile types requiring temperature that is exponentially larger than the number of tile 
types. There are stages 1, 2, . . . , n, with stage i containing 4 tiles, and stage i ensuring that the gap between 
the largest and smallest strength in the stage is at least 2\ In each stage, each of the top two light tiles 
represents a triple (a pair in stage 1) of glues whose sum is at least r. Each of the bottom two dark tiles 
represents a triple of glues whose sum is less than r. The inequalities are satisfiable, for instance, by setting 
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Since it can be assumed without loss of generality that strengths are at most r, this shows that 
the tile set consisting of n stages, having \T\ = An tile types, requires r > 2™ to be realized. □ 

5.2 Temperature Linear in the Number of Tile Types Suffices for 2-Cooperative 
Equivalence 

Theorem 5.1 shows that temperature exponentially larger than the number of tile types is sometimes 
necessary for a TAS's behavior to be realized by integer strengths. However, the definition of local 
equivalence assumes that all possible combinations of sides of a tile type may be present in an 
assembly. Many TAS's are more constrained than this. There is a wide class of TAS's that we 
term 2- cooperative, meaning that all binding events during all assembly sequences use only 1 or 2 
sides that bind with positive strength. Nearly all theoretical TAS's found in the literature are 2- 
cooperative (indeed, temperature 2 systems by definition cannot require three sides to be present, 
although the model allows tile attachments with excess strength). In this section we show that 
the 3-cooperativity of Figure 1 is necessary, by showing that 2-cooperative systems can always be 
realized by strengths linear in the number of tile types. 

Theorem 5.2. Let T = (T,a,g,r) be a TAS, and let T> g ^{t) C D g j(t) be the cooperation set 
of t with respect to g and r restricted to containing only subsets of {N,S, E,W} of cardinality 1 
or 2. Then there is a TAS T = (T,a, g' ,r') with t 1 < 2\T\ + 2 such that, for each t 6 T, 

vfXt) = v { g %,{t). 
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That is, T' is equivalent in behavior to T, so long as all attachments involve only 1 or 2 sides. 



Proof. Let K = \T\. Let A(T) denote the set of all glue labels on tile types in T. Let G = 
{ g(a) | a £ A(T) } \ {0, r}. That is, G is the set of all positive but insufficient glue strengths used 
in this system. |G| < 2K since there are at most \T\ north-south glues and at most \T\ east- west 
glues. 

We split G into two subsets L = { g G G | < g < r/2 } and H = { g e G \ r/2 < g < r }. 
Let L = {£i,£2, ■ ■ ■ , £n} such that t\ < £2 < ■ ■ ■ < £ n < t/2, and let H = {hi, /12, . . . , h m } with 
r/2 < h\ < hi . . . h m < r. For descriptive purposes, define £0 = and £ n +i = t/2 (although these 
numbers may not be glue strengths). 

We aim at designing an algorithm to find a glue function g' satisfying: 

1. for any label a € {di, . . . , o~2k}, 5(c) > t <^=^ g'{o~) >2n + 2; 

2. for any pair of labels {a, a'} C {o~i, 02k}-, g(o~) + 5 r ( (J ') > T g'{o-) + g'{o') >2n + 2. 

Then for r' = 2n + 2, the TAS T = (T, a, g' ', r') satisfies the 2-cooperative equivalence with T that 
we seek. 

First, we define an equivalence relation = on H defined as: for h, h' 6 H , h = h' if (V 1 < i < 
n){h-\-£i > t <J=^> h' + £i > r) . This partitions -fT into subsets H n+ i , . . . , i/2 5 -Hi such that /i S 
if and only if r — £j < h < r = £j-i- In other words, h € Hj if /i + £j > r implies i > j. 

A glue function g' : A(T) — > N is defined as follows: for a label a £ A(T), 

r 0, if = 0; 

, = I i, if g{a) = 4 (1 < i < n); 

9{) ] 2n + 2-i, if 5 (a) G ^ (l<j<n + l); 

[ 2?i + 2, if r < g(o-). 

It is trivial that this satisfies condition (1) above. Let a, a' £ A(T) with < g(a) < g(cr') < r. 
There are two cases. 

1. Suppose that r < g(a) + g(a'). Note that r/2 < g(o-'). If r/2 < g(a), then by definition 
n + 1 < g'(cr) and n + 1 < g'{o~'), whence their sum it at least 2n + 2. If g(o) < r/2, then let 
g(a) = £i for some 1 < i < n. Since r < g(a) + g(cr'), this means that r = £i < g(o~'), which 
in turn implies g(o~') € Hi U U . . . U iJi. By definition, g'(o~) = i and g'(cr') > 2n + 2 — i. 
Consequently, 2n + 2 < p'(<r) + g'{cr'). 

2. Suppose that g{o~)+ g{a') < r. In this case, g(<r) < r/2. If g(a') < r/2, then by the definition 
of g' , both g(o~) and g(o~') are at most n so that their sum cannot reach 2n + 2. Otherwise, 
let g(a) = 1%. An argument similar to the one above gives g(cr') G H n+ i U . . . U Hj+i, whence 
g'(a') < 2n + 2 - i. Thus, g'(a) + g'{a') <2n + 2. 

This verifies that g' satisfies condition (2). □ 

6 Open Questions 

Our polynomial-time algorithm for finding the minimal tile system to self-assemble a square made 
crucial use of our polynomial-time algorithm that, given a strength- free tile system 7lf, finds 
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strengths and a temperature to implement a locally equivalent TAS T, or reports that none exists. 
An open question is whether there is a polynomial-time algorithm that, given a strength- free TAS 
T s i, outputs a TAS of minimal temperature that is locally equivalent to 7if, or reports that none 
exists. In the proof of Theorem 3.1, we expressed the problem as an integer linear program, but 
needed only to find an integer feasible solution that is not necessarily optimal. If we instead wanted 
to find an optimal solution to the program, it is not clear whether the problem restricted to such 
instances is NP-hard. 

The next question is less formal. Our results relating to 3-cooperative and 2-cooperative systems 
(Theorems 5.1 and 5.2, respectively), show that there is a difference in self-assembly "power" 
between these two classes of systems when we consider two tile systems to "behave the same" if 
and only if they are locally equivalent, which is a quite strict notion of behavioral equivalence. 
For example, two tile systems could uniquely self-assemble the same shape even though they have 
different tile types (hence could not possibly be locally equivalent). It would be interesting to 
know to what extent 3-cooperative systems - or high temperature tile systems in general - are 
strictly more powerful than temperature 2 systems under more relaxed notions of equivalence. For 
example, it is not difficult to design a shape that can be uniquely self-assembled with slightly fewer 
tile types at temperature 3 than at temperature 2. How far apart can this gap be pushed? Are 
there shapes with temperature-2 tile complexity that is "significantly" greater than their absolute 
(i.e., unrestricted temperature) tile complexity? 
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A Appendix: Formal Definition of the Abstract Tile Assembly 
Model 

This section gives a terse definition of the abstract Tile Assembly Model (aTAM, [41]). This is not 
a tutorial; for readers unfamiliar with the aTAM, [31] gives an excellent introduction to the model. 

Fix an alphabet X. X* is the set of finite strings over X. Given a discrete object O, {O) 
denotes a standard encoding of O as an element of X*. Z, Z + , N, M + denote the set of integers, 
positive integers, nonnegative integers, and nonnegative real numbers, respectively. For a set A, 
V(A) denotes the power set of A. Given i C Z 2 , the full grid graph of A is the undirected graph 
G { A = (V,E), where V = A, and for all u,v 6 V, {u, v} £ E \\u — v\\2 = 1; i.e., if and only 
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if u and v are adjacent on the integer Cartesian plane. A shape is a set S C Z 2 such that is 
connected. 

A tile type is a tuple t G (X*) 4 ; i.e., a unit square with four sides listed in some standardized 
order, each side having a grZue label (a.k.a. glue) I G E*. For a set of tile types T, let A(T) C X* 
denote the set of all glue labels of tile types in T. Let {N, S, E, W} denote the directions consisting 
of unit vectors {(0, 1), (0, —1), (1, 0), (—1, 0)}. Given a tile type t and a direction d G {N, S, E, W}, 
t(d) G A(T) denotes the glue label on t in direction d. We assume a finite set T of tile types, 
but an infinite number of copies of each tile type, each copy referred to as a tile. An assembly 
is a nonempty connected arrangement of tiles on the integer lattice Z 2 , i.e., a partial function 
a : I? — > T such that C?d om Q is connected and dom a ^ 0. The s/mpe o/ a is dom a. Given two 
assemblies a, f3 : Z 2 — > T, we say a is a subassembly of /3, and we write a C /3, if dom a C dom /3 
and, for all points p G dom a, ct(p) = 

A strength function is a function g : A(T) — > N indicating, for each glue label £, the strength 
g(£) with which it binds. Let a be an assembly and let p G dom a and <i G {N, S, E, W} such that 
p + d G dom a. Let i = a(p) and t! = a(p + d). We say that the tiles t and i' at positions p 
and p + d interact if £(cf) = t'{—d) and g(t(d)) > 0, i.e., if the glue labels on their abutting sides 
are equal and have positive strength. Each assembly a induces a binding graph G\, a grid graph 
G = (V a ,E a ), where V a = dom a, and {pi,P2} G E a a(pi) interacts with a(p2) 10 Given 

r G Z + , a is r -stable if every cut of G\ has weight at least r, where the weight of an edge is the 
strength of the glue it represents. That is, a is r-stable if at least energy r is required to separate 
a into two parts. When r is clear from context, we say a is stable. 

A tile assembly system (TAS) is a triple T = (T,a,g,r), where T is a finite set of tile types, 
a : Z 2 — » T is the finite, r-stable seed assembly, g : A(T) — )• N is the strength function, and 
r G Z + is the temperature. Given two r-stable assemblies a, (3 : Z 2 — - > T, we write a j3 if 
a C /3 and |dom /? \ dom a| = 1. In this case we say a T -produces f3 in one step. 11 If a — y[ (3, 
dom P \ dom a = {p}, and t = f3(p), we write (3 = a + (p \-> t). The T -frontier of a is the set 
d T a = Ua^/? dom /3 \ dom a, the set of empty locations at which a tile could stably attach to a. 

A sequence of k G Z + U {00} assemblies a = (ao,ai, • • •) is a T -assembly sequence if, for all 
1 < % < k, —t-^ aj. We write a — ^ /3, and we say a T -produces (3 (in or more steps) if there 
is a T-assembly sequence a = (oq, a±, . . .) of length k = |dom f3 \ dom a\ + 1 such that 1) a = a®, 
2) dom (3 = Uo<i<fc dom a^, and 3) for all < i < k, ai Q (3. In this case, we say that f3 is the 
result of a, written (3 = res(a). If k is finite then it is routine to verify that res(a) = a&_i. 12 We 
say a is T -producible if a -4^ a, and we write A[T] to denote the set of T-producible canonical 
assemblies. The relation — is a partial order on A[T] [?,?]. 13 A T-assembly sequence ao, a±, . . . 
is fair if, for all i and all p G d^~cti, there exists j such that ctj(p) is defined; i.e., no frontier location 
is "starved". 

10 For G f dolna = (V doma ,E domol ) and G h a = (V a ,E a ), G* is a spanning subgraph of G dom a : V a = Vd om a and 

E a C Ed om a ■ 

11 Intuitively a j3 means that a can grow into /3 by the addition of a single tile; the fact that we require both a 
and P to be r-stable implies in particular that the new tile is able to bind to a with strength at least r. It is easy to 
check that had we instead required only a to be r-stable, and required that the cut of /? separating a from the new 
tile has strength at least r, then this implies that /3 is also r-stable. 

12 If we had defined the relation — > T based on only finite assembly sequences, then — y r would be simply the reflexive, 
transitive closure (—^7")* °f — ■ But this would mean that no infinite assembly could be produced from a finite 
assembly, even though there is a well-defined, unique "limit assembly" of every infinite assembly sequence. 

13 In fact it is a partial order on the set of r-stable assemblies, including even those that are not T-producible. 
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An assembly a is T -terminal if a is r-stable and d^a = 0. It is easy to check that an assembly 
sequence a is fair if and only res(a) is terminal. We write -4n[T] Q A\T\ to denote the set of 
T-producible, T-terminal canonical assemblies. 

A TAS T is directed (a.k.a., deterministic, confluent) if the poset (.4[T] , — is directed; i.e., 
if for each a,j3E A[T], there exists 7 € A[T] such that a — 7 and (3 — 7. 14 We say that a TAS 
T strictly self-assembles a shape S C Z 2 if, for all a € *4o[T], dom a = S; i.e., if every terminal 
assembly produced by T has shape S. If T strictly self-assembles some shape S, we say that T is 
strict. Note that the implication "T is directed T is strict" holds, but the converse does not 
hold. We say that T uniquely self- assembles a shape S if T is directed and it strictly self-assembles 
S. 

When T is clear from context, we may omit T from the notation above and instead write — >i, 
— >, da, frontier, assembly sequence, produces, producible, and terminal. We also assume without 
loss of generality that every single glue or double glue occurring in some tile type in some direction 
also occurs in some tile type in the opposite direction, i.e., there are no "effectively null" single or 
double glues. 



14 The following two convenient characterizations of "directed" are routine to verify. T is directed if and only if 
^□[T]| = 1. T is not directed if and only if there exist a,/3 £ A[T] and p £ dom a n dom /3 such that a(p) 7^ /3(p)- 
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